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Abstract —Due to the fundamental tradeoffs, achieving spec¬ 
trum efficiency and energy efficiency are two contending design 
chaflenges for the future wireless networks. However, applying 
radio-frequency (RF) energy harvesting (EH) in a cognitive radio 
system could potentially circumvent this tradeoff, resulting in a 
secondary system with limitless power supply and meaningful 
achievable information rates. This paper proposes an online 
solution for the optimal time allocation (time sharing) between 
the EH phase and the information transmission (IT) phase in 
an underlay cognitive radio system, which harvests the RE 
energy originating from the primary system. The proposed online 
solution maximizes the average achievable rate of the cognitive 
radio system, subject to the e-percentile protection criteria for 
the primary system. The optimal time sharing achieves significant 
gains compared to equal time allocation between the EH and IT 
phases. 

Index Terms —RF energy harvesting, underlay spectrum shar¬ 
ing, cognitive radio 

I. Introduction 

Energy harvesting (EH) is a highly promising technol¬ 
ogy which can support continuous power supply to energy- 
constrained communication system, such as machine-to- 
machine and sensor networks. Usually the main goal of the EH 
communication systems is to maximize the system through¬ 
put (i.e., the average achievable rate) under some causality 
constraints ID, El, El- The harvested energy can originate 
from the far-held radio frequency (RE) radiation of another 
device (e.g., a closely located communications transmitter 
which can transfer power and information simultaneously), or 
from the ambient RE radiation. Eor example, a shows that, if 
a mobile phone transmits with power of 0.5W, an EH device 
at distances of 1, 5, and 10 meters can harvest as much as 
40mW, 1.6mW, and 0.4mW, respectively. If a source is used 
for simultaneous information and power transfer, there exists a 
fundamental tradeoff between the amount of transferred energy 
and the achievable information rates (known as the rate-energy 
tradeoff) due to the infeasibility of the receiver to extract 
information and energy simultaneously from the same signal 
0 . 

If an energy-constrained system does not have dedicated 
communication resources (e.g., dedicated bandwidth), it can 
still be implemented as a secondary user (SU) system in a 
cognitive radio (CR) setup E). The CR system utilizes one of 
the following three methods for spectrum sharing: interweave 


(i.e. opportunistic), underlay and overlay spectrum sharing 0. 
In the interweave approach the CR devices perform spectrum 
sensing to detect the unoccupied incumbent bands and then 
access the channel in this band. In the underlay approach, the 
CR devices operate in the same band as the primary users 
(PUs), and are allowed to interfere with the primary system 
under some predefined threshold (e.g., based upon the criterion 
for the e-percentile protection of the primary system 0, 0). 
The overlay sharing approach differs from the opportunis¬ 
tic and underlay approaches, because the former approach 
envisions cooperation between the PU and SU systems. In 
the overlay CR systems, both the PU and SU systems adapt 
their transmissions in order to accommodate each other at 
the expense of increased complexity, whereas, in the case of 
the opportunistic and underlay spectrum sensing, the primary 
system is unaware of the secondary one. Actually, the underlay 
spectrum sharing is the simplest method and the easiest to 
implement in practice. 

In this paper, we study the underlay CR systems with RF 
energy harvesting capabilities at the secondary transmitter. 
Using the practically limitless but free power supply from 
the far-held RF radiation, the secondary system can main¬ 
tain continuous information How with meaningful achievable 
information rates. More specihcally, we focus on the scenario 
where the SU and PU systems co-exist in the same commu¬ 
nications band, and additionally the PU transmitter, during 
its normal communication with the PU receiver, also serves 
as a source of RF energy to the SU transmitter. Actually, 
the secondary transmitter exploits the signal coming from the 
PU system, which is considered to be harmful interference 
in the non-EH secondary systems. We aim at maximizing the 
performance of the RE energy harvesting, subject to the e- 
percentile protection criterion of the primary system. To the 
best of the authors’ knowledge, this scenario has not been 
considered in the literature. The paper proposes an online 
solution for the optimal time resource allocation between 
the EH and information transmission (IT) phases in the SU 
system. The proposed online solution is designed to maximize 
the SU average achievable rate while avoiding interfering with 
the PU system but requires the instantaneous CSI of several 
fading channels of the primary and secondary systems at the 
SU transmitter. 

The reminder of the paper is structured as follows. Section 


II presents the recent related works in the area. Section III 
elaborates the proposed and adopted CR system mode. Section 
IV presents the proposed online solution. Section V focuses on 
the performance analysis the proposed online solution. Section 
VI concludes the paper. 

II. Related Work 

Most of the papers that combine CR and EH are focusing 
on scenarios where the CR systems perform energy harvesting 
from outside sources other than RF sources or energy harvest¬ 
ing from ambient RF radiation. So far, none of the exiting 
works have studied the scenario where the CR systems in 
underlay sharing mode harvest the RF energy originating from 
the primary system. 

Moreover, the majority of the related works focus on the 
case of opportunistic spectrum access, where the CR system 
performs the RF energy harvesting and information trans¬ 
mission on separate channels (frequency bands). In ifTOll . the 
authors investigate SU devices that can switch between the 
opportunistic spectrum access and RF energy harvesting, and 
develop an optimal mode selection policy that maximizes the 
averaged SU system rate by balancing between the instan¬ 
taneous rate and the amount of harvested RF energy. The 
work in ca focuses on an opportunistic SU system that 
performs energy harvesting from outside sources (but not RF 
sources). More specifically, determines both the sensing 
duration and sensing threshold that maximize the average rate 
of the SU system for a given amount of harvested energy. 
The paper ifT^ studies a channel selection policy in a multi¬ 
channel opportunistic CR system, in which the SUs can select 
a specific frequency band either for data transmission or for 
RF energy harvesting from occupied channels. The papers 1(131 
and IH extend the system model from llT2l by introducing 
incomplete CSI at the SUs. In llT3l the optimal channel selec¬ 
tion policy is obtained by considering errors in the spectrum 
sensing process. The authors in l(T4l propose an online learning 
algorithm that obtains cognitive radio related environmental 
parameters, like channel availability, interference level etc. 
The proposed algorithm is based on the Markov decision 
process and uses the environmental observations for adjusting 
the channel selection, thus maximizing the secondary system’s 
throughput. 

In ifTsl and ifThl . the authors extend the system model 
of the former papers by introducing stochastic geometric 
approaches for performance assessment of CR systems with 
opportunistic spectrum sharing and RF energy harvesting. The 
paper lITSl considers a scenario where both the RF-powered 
SUs and the PUs follow independent homogeneous Poisson 
Point Processes (PPP) and communicate with their intended 
receivers at fixed distances. Moreover, it is assumed that 
each PU device randomly accesses the spectrum with a given 
probability and that each transmitting PU is centered at a guard 
zone and a harvesting zone that is inside the guard zone. 
Each SU transmitter either harvests the energy of a nearby 
PU transmitter if it lies in its harvesting zone, transmits if it is 
outside the guard zones of the PU transmitters, or it stays idle 


otherwise. The authors derive the optimal transmit power and 
SU density that maximize the secondary system’s rate, subject 
to the outage constraints of both PU and SU systems. In ifT^ . 
the authors investigate a CR based device-to-device (D2D) 
communication capable of harvesting the ambient interference 
resulting from the concurrent macro base station transmissions. 
After harvesting sufficient energy, the CR transmitters op¬ 
portunistically access a predefined nonexclusive channel that 
satisfies the interference constraints of the CR and macro cell 
communication. 

In the case of overlay spectrum sharing, lEl proposes a 
novel approach applicable only to overlay systems, denoted as 
the energy-information cooperation. The PU network provides 
both spectrum and energy to the SU network so that the SU 
network will aid (in a relaying fashion) the PU transmission 
in return. Assuming availability of non-causal PU informa¬ 
tion, the proposed energy-information cooperation can achieve 
substantial performance gain compared to the conventional 
information cooperation only. However, the system model 
assumptions, regarding the full PU-SU cooperation and non- 
causal information at the SUs limit the practical applicability 
of the approach. 

An underlay CR system with RF energy harvesting has not 
been studied so far. 

HI. System model 

In our considered underlay CR system, the SU is a point-to- 
point communication system, and consists of an EH transmitter 
{EH — Ts) and a secondary receiver (Rs)- The PU is also 
a point-to-point communication system, and consists of a 
primary transmitter (Tp) and a primary receiver (Rp). The 
EH — Ts is equipped with a rechargeable battery, which 
is recharged by harvesting the RF energy in the common 
communication channel (i.e., the common frequency band). 
Without any loss of generality, the EH — Ts battery capacity is 
assumed to be large enough to accumulate the entire amount of 
radiated PU energy per communication slot. For example, the 
radiated power from a DVB-T transmitter with Pp = lOkW 
at a distance of 100m is approximately 1.1 mW (assuming 
Free Space Path Loss). This is significantly lower than the 
(power) capacity of conventional mobile device batteries that 
range in the magnitude of several Watts. Apart from being an 
information source for the Rp, and an RF energy source for 
the EH — Ts, the RF energy radiated from the Tp is also 
a cause of interference for the Rs- The considered cognitive 
system is depicted in Fig. [T] 

It is assumed that the system operates in a block fading 
environment, where the channel gains remain constant during 
one channel block of duration T, but change from one block 
to the other. The time in the secondary system is divided into 
slots of equal duration, and each time slot lasts exactly one 
channel block of duration T. 

The total number of considered times slots is M. In the 
time slot, we define the following fading power gains (depicted 
in Fig. [T]i: the gain of the EH — Ts to Rs channel is denoted 
by Xi, the gain of the Tp to Rs channel by yi, the gain of the 



where Pi represents the transmit power of the EH—Ts in time 
slot i, and jth is the maximum tolerable interference power 
from EH — Ts at the Rp (denoted as the PUI threshold). 

Assuming ideal harvesting and conversion efficiencies, the 
energy harvested by EH — Ts in the EH phase of slot i is 
given hy Ei = {1 — ai)giPT- Consequently, average transmit 
power from the EH — Ts during the IT phase of slot i is given 
by Pi = Ei/ai = (1 — ai)giPT/{cti)- Hence, ([T]i is rewritten 
as: 


Fig. 1. System model 


Tp to EH — Ts by gi, and the gain of the EH — Ts to Rp 
by Zi. The power of the additive Gaussian noise at the Rs is 
We assume that the Tp continuously transmits with fixed 
output power Pp. 

During each time slot, the secondary system operation is 
divided in two phases: an EH phase and an information 
transmission (IT) phase. In the EH phase, EH — Ts harvests 
the RE radiation from Tp, whereas it transmits during the IT 
phase. However, EH — Ts adapts the duration of the phases 
from one slot to the other. In time slot, the duration of the 
EH phase is (1 — ai)T, and the duration of the subsequent 
IT phase is a^T, where ai is the time-sharing parameter. As 
seen in the following section, the time-sharing parameter 
(0 < < 1) is actually our optimization variable. 

In the rest of the paper, we assume a normalized unit block 
time (T = 1) without loss of generality, so we use the terms 
of energy and power interchangeably. Note, we assume that 
during the IT phase, the EH—Ts completely drains the energy 
harvested in its battery during the preceding EH phase. 

In the beginning of each time slot, the secondary system 
is assumed to have the instantaneous CSI available for the 
channels: xp t/p gi, Zi i.e. it knows the referred channel power 
gains before the SU’s transmission. In practice the channel 
gain Xi can be estimated prior to the transmission, either via 
a training sequence or in the uplink period if a time division 
duplex (TDD) scheme is used (assuming channel reciprocity). 
Due to the constant and known PU transmit power Pp the 
channel gain gi can be estimated by the EH — Ts, prior to 
the secondary communication, exploiting the primary system 
training sequence. The channel gain yi can be estimated by the 
Rs (in the same fashion as for channel gain gi) and reported 
to the EH — Ts- Einally, the channel gain Zi, being the most 
complex one to estimate and fed back, can be either obtained 
from a possible PU-SU cooperation, or in the PU uplink period 
if assuming a TDD scheme for the PU system. 

A. Primary system constraints 

We adopt e-percentile protection criteria for the primary 
system which means that the outage probability of the 

primary system caused by the interference from the EH — Ts 
transmission does not exceed the e threshold: 


( 1 ) 
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< e. 

( 2 ) 


In order to express dU in terms of a statistical or a time 
average, we introduce the indicator function as follows: 


I{ai) 


0 , 

1 , 


ai > 


fliPTZj 

ith+giPT^i 

otherwise 


(3) 


Now, © can be rewritten as E[J(ai)] < e , where E [•] 
denotes the expectation. 


IV. Optimal time resource allocation 

Given the available transmit power Pi in the beginning of 
IT phase in slot i, the achievable rate of the secondary system 
during the IT phase is given by na: 

= azlog2 (l + , 

where Si is expressed as: 


gjPrXj 
y,Pp + 0-2 ■ 


(5) 


The rate © can be achieved by Gaussian distributed code¬ 
words transmitted by EH — Ts and reliably decoded by Rs 
at the end of slot i. 

We now define the following optimization problem (PI): 


maxi? 

cti 

s.t. Pr {PiZi >ith} < e, (6) 

0 < di < 1, 

where T = E [i?] is the average achievable rate (i.e., expec¬ 
tation over all fading channel states). Eor sufficiently high 
number of time slots M, such that M ^ oo, PI can be 
equivalently written as (P2): 


M 

max 

“i.V* i=i 

M (PS 

s-t-ii E I («i) < e. 

i=l 

0 < tti < 1, 

where Ri is given by ®. Note, the constraint in 0 is the 
PU protection criterion expressed in terms of the indicator 
function ©. We now apply the Lagrange duality method ISl, 


Vv{PiZi > ith} < e, 










then introduce the Lagrange multiplier A associated with the 
constraint in and then decouple (P2) into parallel sub¬ 
problems (one for each fading state), which yields: 


max (Ri — XI (ofi)). (8) 

0<ai<l 

A. Optimal solution 

The optimization problem P2 is a convex optimization 
problem, because the objective function is a concave function 
(a:log(l + 1/a;) is concave), whereas the constraint is a step 
function lfT9l . In the cases when I {at) — 0, the optimal ai 
that maximizes the achievable rate for a particular fading state 
i, dUl is solved similarly as [18, Eqs. (38) and (40)], yielding; 


ctii — 


'S'i + Zg — 1 


where zg > 1 is the unique solution to the equation; 


(9) 


z\nz — z — Si+\ = 0. (10) 


However, if the primary link outage constraint is violated, 
which occurs when e [0, g^PTZi/{ q^Rtz^ + "fth)], the 
optimal solution must guarantee that PU outages do not occur 
while the rate is as high as possible (Fig. 3 in [18]), thus: 


gjPrZj 
giPTZi + 7t/i 


( 11 ) 


Since the outage probability threshold in the PU system is 
set to be e, we can tolerate outages in 100-e percent of the time 
in order to maximize the long-term average achievable rate 
(i.e., over M ^ oo time slots). This threshold can be achieved 
by the appropriate selection of the Lagrange multiplies A that 
appears in ®. It can be determined numerically such that the 
constraint in © is satisfied. In a practical system, A can be 
computed during the system training phase and then it can be 
used for online determination of the time-sharing parameter 
tti, as according to the solution of ([8]l, which is given by: 



Fig. 2. Illustration of the optimization problem 


Case 2: < a^ 2 - 1^ this event, there are two sub-cases 

depending on the values of an and 0 ^ 2 - More specifically, 
if < Ri 2 than the optimal value of the time resource 
allocation should be equal to ai 2 (Fig. Eh). If Rn > R^2, 
than the optimal value an maximizes the achievable rate of 
the penalty introduced by the A parameter (Fig. El)- 

V. Average achievable rates 

This section analyzes the average achievable rates of the 
considered CR system with RF energy harvesting. The rates 
are determined in function of the average harvested energy, 
the channel gain, the outage probability threshold and the PUI 
threshold. For this purpose, we use Monte Carlo simulations. 
The simulation setup is presented in Table U 

TABLE I 

Simulation SETUP 


Parameter types 

Parameter values 

Simulation environment 

Matlab 

Number of time slots per simulation run {M) 

10000 

Channel model 

Rayleigh fading 

PU outage probability (e) 

from 0.01 to 0.99 

Noise power {<j^) 

-90dBm 


{ an, l{a^i) = 0 

an, I{an) = l and R^^> Ra , (12) 

o ^^2 I (an) = 1 and R^-^ < R^ 

where Rn and Ri 2 are defined as: 

Pii = /(«ii) 

R^2 = f{aa), (13) 

f{a) = a\og 2 (1 + ^S) . 

Note that both an and a^ satisfy the second constraint of 
(|6]l and ©. Therefore, ([T^ -dOll is the optimal solution of our 
optimization problems PI and P2. Based on (IT^ and (lOT l. we 
can distinguish between the following cases (Fig. Ell: 

Case 1: an > < 2 ^ 2 - this case, the optimal value an 
satisfies the interference constraints towards the Rp, ©, and 
at this value of an, the indicator function I(an) is zero, in 
which case R^^ = f(a^i). Hence, the CR system is capable 
to achieve the maximum possible rate (Fig. El)- 


For the various curves, we consider realistic values of 
the channel gain averages px, p-y, Pz, Pg (from to 

10-2) and PUI thresholds -ph (from -90dBm to -30dBm). The 
average achievable rate is calculated as: 

1 ^ 

where Ri is given by @- 

Fig. El presents the average achievable rate of the CR 
system in function of the average SU transmit power (Phe), 
expressed in dBm. The average SU transmit power reflects the 
actual energy harvested from Tp, and is calculated as: 


Phe = E 


1 - ai 


ai 


giPr 


M 




i=l 


ai 


The different curves are for the various combinations of 
Px, Py and pz- In the best case scenario, when a) the PU 































Fig. 3. Average achievable rate in dependance of the average harvested PU 
power (PUI thi'eshold = -90 dBm, Outage probability threshold = 1%, PU 
transmit power Pt = 30dBW) 

and SU transmitters are close to each other, and also b) both 
receivers experience small amount of interference (i.e. substan¬ 
tially far apart or/weak interference channels yi and Zi), then 
the CR system achieves practically useful information rates 
(black circles). The average achievable rate of the SU system 
decreases significantly when the receivers experience higher 
interference margins (blue rectangles and doted green lines). 
More specifically, the CR system achieves better performance 
when it is subjected to higher interference (blue line with 
rectangles) instead of causing high interference to the PU 
receiver (doted green line). This behavior is due to the nature 
of the underlay spectrum sharing method, which allows the 
SU to interfere with the PU system in only a fraction of 
the time (e = 1% in this case). This means that the energy 
harvesting CR system is more robust and favors scenarios 
where the pathloss between the SU transmitter and PU receiver 
is higher compared to the pathloss on the channel between the 
PU transmitter and the SU receiver. 

In the worst case scenario, the harvested energy is small 
(the PU and SU transmitters are far apart) and the interference 
channels yi and experience low path-loss, in which case, 
the secondary system achieves negligible information rates. 
This is an expected result, because for very low (i.e. minimal) 
amounts of harvested energy, the PU has a minimal budget of 
transmit power. Additionally, because of the low pathloss on 
the interference channels, the SU receiver suffers from high 
PU interference, resulting in practically useless information 
rates (dashed red and yellow lines). 

As discussed in the previous figure, the pathloss incurred on 
channel z crucially affects the average achievable rate. Fig. |4] 
depicts the average achievable rate in function of the average 
channel gain g {p,g), expressed in dB as 10Zopl0(/ig), while 
average channel gain of the channel z {pLz) appears as a curve 
parameter. For comparison purposes, the figure also presents 
the average achievable rate for a system with equal and fixed 
durations of the EH and IT phases (i.e., ai = 0.5, Vi). The 
figure also depicts the theoretical upper bound of the average 
achievable rate, which is obtained when the primary system 
constraint, expressed by O, is neglected from the optimization 



Fig. 4. Average achievable rate in dependance of the gain on channel g 
(PUI threshold = -90dBm, Outage probability threshold = \%, g.j. = 10“^, 
^ly = 10-U 



Fig. 5. Average achievable rate in dependance of the outage probability 
threshold (PUI threshold = -90dBm , /ly = 10“^, fix = 10“®, fiz = 10“®) 

problem (in this case, a* = 

It is evident that the parameter pz significantly affects the 
performance of the CR system. We also note that our proposed 
solution significantly outperforms the scenario with ai = 0.5, 
which justifies the optimal time resource allocation. Moreover, 
when the channel z is very week (e.g. pz = 10 “® or pz = 
10 “®), then the proposed solution achieves rates close to the 
upper bound for the case pz = 10 “®, and almost identical 
to the upper bound for the case = 10“®. Because of the 
high path-loss on the interference channel between the SU 
transmitter and PU receiver, the CR system can exploit the 
full potential of the harvested energy without causing harmful 
interference to the PU system while attain information rates 
close to the upper bound. 

Fig. |5] depicts the performance of the CR system in function 
of the PU outage probability threshold (e). We notice that 
the average achievable rate of the considered CR system 
depends on e and also on the amount of harvested RF energy. 
Intuitively, the average achievable rate of the CR system 
increases when the secondary system causes more frequent 
outages to the PU system. 

Fig. | 6 ] depicts the average achievable rate in function of the 
PUI threshold ( 74 / 1 ), expressed in dBm, for different values of 
the average SU transmit powers (Phe)- We notice that higher 
























Fig. 6. Average achievable rate in dependance of the PUI threshold (PU 
outage probability = 1%, = 10“^, fix = 10“®, /Tz = 10“®) 

rates are achieved for higher amounts of harvested energy. 
Additionally, the average achievable rate increases when the 
PU interference constraint is relaxed (i.e. with increasing jth)- 
However, the average achievable rate begins to saturate after 
some PUI threshold, which in turn heavily depends on the 
amount of harvested energy. Increasing Phe, the saturation 
point moves towards the higher PUI thresholds. These sat¬ 
uration points can play crucial role when dimensioning and 
optimizing the CR network. For example, the presented system 
model and optimization can be easily applied to the scenario 
of cognitive D2D communication system, where the SU is 
the D2D system, while the PU system is a mobile cellular 
system (consisting of a base and mobile stations). In this 
case it is vital for the D2D devices to achieve the highest 
possible information rate while considering the amount of 
harmful generated imposed on the PU (cellular) system. The 
saturation points of Fig. |6] gives an idea how to optimize the 
overall system by achieving the highest information rate in the 
SU system and low interference in the PU system. 

VI. Conclusion 

This work studies an underlay CR system with RF energy 
harvesting, and determines the optimal time resource alloca¬ 
tion of the energy harvesting and information transmission 
phases. The proposed optimal solution is an online solution 
that maximizes the average achievable rate of the CR system 
while limiting the harmful interference from the CR system 
to the PU system. The simulation results show that the CR 
system can attain practically useful throughput when subjected 
to the influence of a powerful primary transmitter. The solution 
always outperforms the case of fixed time allocation and in 
some cases comes very close to the information rates that can 
be achieved when the protection criterion constraint of the 
primary system is not considered. The proposed system and 
its optimal solution is applicable to future wireless networks, 
such as the cognitive D2D communication systems. 
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